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Abstract 
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1 Introduction 



It is a known observation [flj] that a relationship exists between the codons and the physical- 
chemical properties (PCP) of the coded amino acids (a. a.). The observed pattern is read either 
as a relic of some kind of interaction between the a. a. and the nucleotides at an early stage of 
evolution or as the existence of a mechanism relating the properties of codons with those of a. a.. 
In particular it is observed that the relationship depends essentially on the nature of the second 
nucleotide in the codons and it holds when the second nucleotide is adenine (A), uracil (U) or 
cytosine (C), not when it is guanine (G). To our knowledge neither the anomalous behaviour of 
G nor the existence of a closest relationship between some of the a.a. is understood. It is the 
aim of this paper to provide an explanation of both these facts in the framework of the model 
of genetic code we have discussed in a previous paper p|, called crystal basis model, as it is 
heavily based on the mathematical structure of the modules of the irreducible representations 
(IR) of U q ^o[sl(2) © sl(2)], known in the mathematical literature as crystal basis f3j. After 
recalling briefly the model in Sect. 2, we derive in Sect. 3 a set of relations between the PCP of 
the amino acids, based on the content of the IRs of the dinucleotides and of the codons coding 
for the amino acids. Finally we compare our predictions with the experimental data. 



2 The Model 

The four nucleotides A, C, G, U are considered as basic states of the (~, ~) irreducible rep- 
resentation of the quantum enveloping algebra U q [sl{2) © si (2)] in the limit q — > 0, with the 
following assignment of quantum numbers: 

c = (+!,+§) u = (-!,+!) g = (+§,-!) a = (-£,4) (i) 

In this framework, the codons are built as composite states of the nucleotide states by tensoring 
three such (■§, 5) representations. Note that the crystal basis, which exists in the limit q —> 
of the g-deformed universal enveloping algebra U q (G) for any semi-simple Lie algebra G, is the 
only way to provide such composite states as pure states, and to ensure the existence of an 
order in the tensoring procedure. 

However, it is well-known (see Table [3]) that in a multiplet of codons relative to a specific 
amino acid, the first two bases constituent of a codon are "relatively stable", the degeneracy 
being mainly generated by the third nucleotide. Considering the tensor product 

(f , \) © (§, |) = (1, 1) © (1, 0) © (0, 1) © (0, 0) (2) 

we get, using Kashiwara's theorem ||, the following tableau: 

- sl(2) H (0,0) (CA) (1,0) 

i 

sl(2) v (0,1) I GU I (1.1) 




CG 


UG 


UA ) 


cc 


uc 


uu \ 


GC 


AC 


AU 


GG 


AG 


AA / 
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where the subscripts H (:= horizontal) and V (:= vertical) specify the two sl(2). 

From Table [l], the dinucleotide states formed by the first two nucleotides in a codon can 
be put in correspondence with quadruplets, doublets or singlets of codons relative to an amino 
acid, the sextets (resp. triplets) being viewed as the sum of a quadruplet and a doublet (resp. 
a doublet and a singlet). We define in our model the "charge" Q of a dinucleotide state by 

Q = J3,H + \C V (J 3 y + l)-\ (3) 

The dinucleotide states are then split into two octets with respect to the charge Q: the eight 
strong dinucleotides CC, GC, CG, GG, CU, GU, UC, AC associated to the quadruplets (as well 
as those included in the sextets) of codons satisfy Q > 0, while the eight weak dinucleotides 
AA, AU, UA, UU, UG, AG, CA, GA associated to the doublets (as well as those included in 
the triplets) and eventually to the singlets of codons satisfy Q < 0. Let us remark that by the 
change C <-> A and U <-> G, which is equivalent to the change of the sign of J^ t0l or to reflexion 
with respect to the diagonals of the eq. ([I]), the 8 strong dinucleotides are transformed into 
weak ones and vice-versa. Note that a first attempt to differentiate between strong and weak 
dinucleotides is contained in ref. ||. 

The irreducible representations of the tensor product of (~, |)® 3 as well as the correspon- 
dence codons/amino acids for the eukariotic code is reported in Table [|. The upper labels 
denote different IRs. 

3 Relationship between the PCP of Amino Acids 

We assume that some PCP of a given amino acid are related to the nature of the codons, in 
particular they depend on the following mathematical features, written in hierarchical order: 

1. the IR of the dinucleotide formed by the first two nucleotides; 

2. the sign of the charge Q, eq. @, on the dinucleotide state; 

3. the value of the third component of J^y inside a fixed IR for the dinucleotides; 

4. the upper label(s) of the codon IR(s); 

Not all the PCP are supposed to follow the scheme above; some of them are essentially given 
by the specific chemical structure of the amino acid itself. 

In the following, we analyze the PCP of the amino acids in the light of the dinucleotide 
content of the irreducible representations of eq. (Q). 

- representation (0,0) 

The codons of the form CAN (N = C, U, G, A) all belong to the IR (f , \) A and code for His 
and Gin, both being coded by doublets and differing by the value of J%y. Then we expect that 
the PCP of His and Gin are very close. 
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- representation (1,0) 

We analyse now the codons built up by the dinucleotide IR (1,0). i.e. CG (Q > 0), UG, 
UA (both Q < 0). The codons CGS (S = C, G), resp. CGW (W = U, A), belonging to IR 
(3/2, 1/2) 2 , resp. (1/2, 1/2) 2 , all code for Arg, so we do not have any relation. The codons UGS, 
resp. UGW, belonging to IR (3/2, 1/2) 2 , resp. (1/2, 1/2) 2 , code for Cys (which is a doublet) 
and Trp (singlet), resp. the other Cys and Ter (triplets). So we expect the PCP of Cys not 
very different from those of Trp. The codons UAN, belonging to the IR (3/2, 1/2) 2 , code for 
the Tyr and Ter. So we expect some affinity between the a. a. coded by UGN and UAN, in 
particular between Cys and Tyr both being coded by doublets. 

- representation (0,1) 

The codons built up by the dinucleotide IR (0, 1) are CU, GU (both Q > 0) and GA (Q < 0). 
The codons CUY and GUY (Y = C, U), resp. CUR and GUR (R = G, A), belonging to IR 
(1/2, 3/2) 2 , resp. (1/2, 1/2) 3 , code for Leu and Val. Therefore we do not have any relation 
between a. a. coded by the same dinucleotide, but we expect that the PCP of Leu and Val are 
close since CU and GU both belong to the same IR and are both strong. The codons GAN 
belong to the IR (1/2, 3/2) 2 and they code Asp and Glu (both doublets). Then we expect the 
PCP of Asp and Glu to be very close. 

- representation (1,1) 

The dinucleotide IR (1, 1) contains five states with Q > (CC, UC, GC, AC, GG). The codons 
CCN , resp. UCN, belong to four different IRs and code for Pro (quartet), resp. Ser (sextets). 
We expect an affinity between the PCP of these a.a.. The codons GCN , resp. ACN, belong to 
four different IRs and code for Ala (quartet), resp. Thr (quartets). We expect a strong affinity 
between the PCP of Ala and Thr. The codons GGN belong to two different IRs and code for 
Gly, so we expect an affinity of PCP of Gly with those of Pro, Ser, Ala, Thr. Now let us look 
at the four states with Q < (UU, AU, AG, AA). The codons UUN belong to two different 
IRs and code for Leu, the doublet subpart of the sextet, and for Phe (doublet). An affinity 
is expected between the PCP of these two a.a.. The codons AUN belong to two different IRs 
and code lie (triplet) and Met (singlet) and, in fact, the values of PCP of these two 
not very different. The codons AGN belong to two different IRs and code for Ser and Arg, the 
doublet subpart of the sextet, so an affinity between the PCP of these codons is expected. The 
codons AAN belong to the same IR (3/2,3/2) and code for Asn and Lys, so the values of the 
PCP of these Ob. Ob. should be close. 

Note that for the three sextets (Arg, Leu, Ser) the quartet (doublet) subpart is coded by a 
codon with a strong (weak) dinucleotide. 

In conclusion we predict the following relations between the values of the PCP of the amino 
acids (= means strong affinity, ~ affinity, ~ weak affinity): 

1. His ^ Gin 

2. Asp ^ Glu 
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3. Asn = Lys ~ Arg, Ser 

4. Cys = Tyr « Trp 

5. Leu = Val 

6. Pro = Ser « Gly 

7. Ala = Thr « Gly Pro, Ser 

8. lie = Met » Phe 

4 Discussion 

We have compared our theoretical previsions with 10 physical-chemical properties: 

1. the Chou-Fasman conformational parameters P a , Pp and P T which gives a measure of 
the probability of the a. a. to form respectively a helix, a sheet and a turn. However it has 
been suggested in that the sum P a + P/3 is a more appropriate parameter to characterize 
the generic structure forming potential while the difference P a — Pp is a more appropriate 
parameter for the helix forming potential, which is a quantity more depending on the 
particular a.a.. So we compare with P a + Pp and P r ; 

2. the Grantham polarity Pq 0; 

3. the relative hydrophilicity Rf as computed by Weber and Lacey M; 

4. the thermodynamic activation parameters at 298 K: AH (enthalpy, in kJ/mol), AG 
(free energy, in kJ/mol) and AS (entropy, in J/mole/K) as obtained by Siemion and 
Stefanowicz ||; 

5. the negative of the logarithm of the dissociation constants at 298 K: pK a for the a-COOH 
group and pK b for the a-NHg group |10|| ; 



6. the isoelectronic point pi [|T(|, i.e. the pH value at which no electrophoresis occurs. 



The comparison between the theoretical relations and the experimental values shows (see Tables 
|and |): 

1. His = Gin - The agreement, except for pi, is very good. 

2. Asp = Glu - The agreement, except for P T , is very good. 

3. Asn = Lys ~ Arg, Ser - The agreement, except for pi and P T is very good. The comparison 
with the values of PCP of Ser and Arg is satisfactory. 
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4. Cys = Tyr « Trp - Except for Rf, the agreement between the first two a.a. is very good, 
while with Trp is satisfactory. 

5. Leu = Val - The agreement is very good. 

6. Pro = Ser « Gly - The agreement is very good, except for P a + Pp and AH, and with 
Gly more than satisfactory. 

7. Ala = Thr » Gly, Pro, Ser - The agreement is very good between the first two a.a. except 
for P T and satisfactory with the others except for the conformational parameters. 

8. He = Met ~ Phe - The agreement is very good between the first two a.a. and satisfactory 
with Phe. 

In order to have a more quantitative evaluation of the agreement between the data and the the- 
oretical model we compute the mean value and the standard deviation of the whole population 
(i.e. the twenty amino-acids) for each generic PCP x 



x 



n 

n ^ 



Xi and cr,, 



\ 



n 

n ^ 



X j X J 



(4) 



in 3 different cases: 

1. considering all the amino acids, i.e. summing over the whole set of amino acids (n = 20); 

2. considering the amino acids coded by the codons with the same second nucleotide (n = 
4 for C, n = 5 for U and G, n = 7 for A); 

3. considering the couples (n = 2) of amino acids given by our 8 relations. 
The results are reported in Tables § to £|. 

As an estimate of the accurary of our predictions, we compute the sum of the adimensional 
quantities given for any PCP by the ratios a x /x of the standard deviation to the average value, 
in the three different cases considered above. 



all 1.82 


A 1.82 


C 1.13 


G 1.82 


U 0.70 


Gin/His 
0.35 


Asp/Glu 
0.63 


Asn/Lys 
0.72 


Cys/Tyr 
0.80 


Pro/Ser 
0.77 


Ala/Thr 
0.48 


lie/Met 
0.39 


Leu/ Val 
0.38 



Values of T] = ^2<j x /x 

Looking to the table above, we remark that the characterization of the PCP from the nature of 
the second nucleotide in the codons is, except for U, indeed not really discriminatory. On the 
contrary, the relations derived by our model reduces dramatically the values of our estimate 
parameter rj = Ylii a x/'^- 
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From Table § we note that the values of the thermodynamical quantities for some amino 
acids are not yet measured. Our relations predict that the values of AH and —AS for His 
should be around 60 kJ/mol and 120 kJ/mol/K. For the values of Asp and Glu coded by the 
unique weak nucleotide GA in the IR (0,1), we do not have any available data to compare with. 
However, making the further assumption, supported by an inspection of the available data, that 
the values of the thermodynamical quantities depend slightly on the nature of the charge, we 
predict that the values of these quantities for Asp and Glu should not be very dissimilar from 
the values of the analogous quantities for Leu and Val, coded by the strong dinucleotide CU 
an GU in the IR (0,1). So we predict that for Asp and Glu, one should find AH w 60 kJ/mol, 
—AS ss 135 kJ/mol/K and AG P3 100 kJ/mol. 



5 Conlusion 

In conclusion, the values of PCP show, with a few exceptions, a pattern of correlations which 
is expected from the assumptions done in the crystal basis model. Let us emphasize that the 
assignment of the value of the IRs both for the dinucleotide and for the trinucleotide states or 
codons is a straightforward consequence of our model. The remarked property that the a.a. 
coded by codons whose second nucleotide is G do not share similarity in the properties of PCP 
with other a.a. does find an explication in the model as it is immediate to verify, looking to the 
table of the tensor product of two (|, |) that that are no two states with G in second position 
which share simultaneously the properties of belonging to the same IR and being characterized 
by the same value of Q. 

A final remark: it has beeen sugested that the physico-chemical properties of the amino 
acids have played a fundamental role in the evolution and the organization of the genetic code. 
In this framework, a meaure of the relative distances between amino acids has been defined, see 



TTJ and references therein. The picture which emerges in |TTJ has some striking confirmations 
with the predictions of our model. So we argue that the assignment of codons in multiplets, 
corresponding to irreducible representations, may have a deep connection with the evolutionary 
organization. 

Acknowledgments: We are grateful to M.L. Chiusano for pointing us the interest to 
analyse the physical-chemical properties in the light of the crystal basis model and for providing 
us literature on the subject. Partially supported by MURST (Italy) and MAE (France) in the 
framework of french-italian collaboration Galileo. 
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Table 1: The eukariotic code. The upper label denotes different irreducible representations. 



codon a. a. 


Jh Jv 


codon a. a. 


Jh Jv 


CCC Pro 
CCU Pro 
CCC Pro 
CCA Pro 


3/2 3/2 
(1/2 3/2) 1 
(3/2 1/2) 1 
(1/2 1/2) 1 


UCC Ser 
UCU Ser 
UCG Ser 
UCA Ser 


3/2 3/2 
(1/2 3/2) 1 
(3/2 1/2) 1 
(1/2 1/2) 1 


CUC Leu 
CUU Leu 
CUC Leu 
CUA Leu 


(1/2 3/2)* 
(1/2 3/2) 2 
(1/2 1/2) 3 
(1/2 1/2) 3 


UUC Phc 
UUU Phc 
UUG Leu 
UUA Leu 


3/2 3/2 
3/2 3/2 
(3/2 1/2) 1 
(3/2 1/2) 1 


CGC Arg 
CGU Arg 
CGG Arg 
CGA Arg 


(3/2 1/2)* 
(1/2 1/2) 2 
(3/2 1/2) 2 
(1/2 1/2) 2 


UGC Cys 
UGU Cys 
UGG Trp 
UGA Ter 


(3/2 1/2) 2 
(1/2 1/2) 2 
(3/2 1/2) 2 
(1/2 1/2) 2 


CAC His 
CAU His 
CAG Gin 
CAA Gin 


(1/2 1/2) 4 
(1/2 1/2) 4 
(1/2 1/2) 4 
(1/2 1/2) 4 


UAC Tyr 
UAU Tyr 
UAG Ter 
UAA Ter 


(3/2 1/2) 2 
(3/2 1/2) 2 
(3/2 1/2) 2 
(3/2 1/2) 2 


GCC Ala 
GCU Ala 
GCG Ala 
GCA Ala 


3/2 3/2 
(1/2 3/2) 1 
(3/2 1/2) 1 
(1/2 1/2) 1 


ACC Thr 
ACU Thr 
ACG Thr 
ACA Thr 


3/2 3/2 
(1/2 3/2) 1 
(3/2 1/2) 1 
(1/2 1/2) 1 


GUC Val 
GUU Val 
GUG Val 
GUA Val 


(1/2 3/2) 2 
(1/2 3/2) 2 
(1/2 1/2) 3 
(1/2 1/2) 3 


AUC He 
AUU He 
AUG Met 
AUA He 


3/2 3/2 
3/2 3/2 
(3/2 1/2) 1 
(3/2 1/2) 1 


GGC Gly 
GGU Gly 
GGG Gly 
GGA Gly 


3/2 3/2 
(1/2 3/2) 1 
3/2 3/2 
(1/2 3/2) 1 


AGC Ser 
AGU Ser 
AGG Arg 
AGA Arg 


3/2 3/2 
(1/2 3/2) 1 
3/2 3/2 
(1/2 3/2) 1 


GAC Asp 
GAU Asp 
GAG Glu 
GAA Glu 


(1/2 3/2) 2 
(1/2 3/2) 2 
(1/2 3/2) 2 
(1/2 3/2) 2 


AAC Asn 
AAU Asn 
AAG Lys 
AAA Lys 


3/2 3/2 
3/2 3/2 
3/2 3/2 
3/2 3/2 
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Amino acid 


p j- P„ 

r a T 


p 

r T 


p„ 


H f 




A Q 






pKb 




Ala 


2.25 


0.66 


8.09 


0.89 


50.7 


147.8 


94.66 


2.34 


9.69 


6.00 


Arg 


1.91 


0.95 


10.50 


0.88 


54.8 


143.2 


97.5 


2.17 


9.04 


11.15 


Asn 


1.60 


1.56 


11.5 


0.89 


55 


135.8 


95.51 


2.02 


8.80 


5.41 


Asp 


1.55 


1.46 


13 


0.87 








1.88 


9.60 


2.77 


Cys 


1.89 


1.19 


5.5 


0.85 


43.9 


163.2 


92.89 


1.96 


10.28 


5.02 


Gin 


2.21 


0.98 


10.5 


0.82 


61.1 


121.8 


97.30 


2.17 


9.13 


5.65 


Glu 


1.88 


0.74 


12.2 


0.84 


— 


— 


— 


2.19 


9.67 


3.22 


Gly 


1.32 


1.56 


9 


0.92 


49.0 


140.6 


91.08 


2.34 


9.60 


5.97 


His 


1.87 


0.95 


10.4 


0.83 


— 


— 


94.59 


1.82 


9.17 


7.47 


He 


2.68 


0.47 


5.2 


0.76 


57.3 


152.3 


102.6 


2.36 


9.60 


5.94 


Leu 


2.51 


0.59 


4.9 


0.73 


59.4 


128.1 


97.70 


2.36 


9.60 


5.98 


Lys 


i on 
i.yu 


1 m 

l.Ul 


11. o 


n Q7 
u.y t 


V7 8 


i fi 


Q8 1 K 
yo. ID 


9 18 
Z.lo 


8 OA 

o.yo 


y .oy 


ivieL 


Z.OU 


n fin 


^ 7 


n 7A 


^8 9 


1 98 ^ 


Qfi fi^ 


9 98 


Q 91 




r ne 


Z.Ol 


n fin 


t\ 9 


n ^9 


4o.o 


1 ^fi 1 
10D. 1 


Q/1 7K 


1 89 


y.io 


K /I8 


Pro 


1.12 


1.52 


8 


0.82 


50.7 


164.5 


99.49 


1.99 


10.60 


6.30 


Ser 


1.52 


1.43 


9.19 


0.96 


36.0 


180 


89.98 


2.21 


9.15 


5.68 


Thr 


2.02 


0.96 


8.59 


0.92 


53.4 


136.0 


93.98 


2.09 


9.10 


5.64 


Trp 


2.45 


0.96 


5.4 


0.2 


51.1 


151.5 


96.25 


2.83 


9.39 


5.89 


Tyr 


2.16 


1.14 


6.2 


0.49 


54.4 


139.0 


95.68 


2.20 


9.11 


5.66 


Val 


2.76 


0.50 


5.9 


0.85 


60.2 


139.0 


101.7 


2.32 


9.62 


5.96 


X 


2.03 


0.99 


8.31 


0.79 


53.02 


144.88 


96.14 


2.18 


9.42 


6.03 


O x 


0.45 


0.36 


2.61 


0.18 


6.23 


15.57 


3.16 


0.23 


0.43 


1.76 


a x jx 


22 % 


36 % 


31 % 


23 % 


12 % 


10 % 


3 % 


10 % 


5 % 


29 % 



Table 2: Table of amino-acid properties 







P a + P[3 


Pr 


Pg 


Rf 


AH 


-AS 


AG 




pK b 


pj 




X 


1.88 


1.12 


10.73 


0.82 


57.08 


133.05 


96.25 


2.07 


9.20 


5.68 


A 


Ox 


0.23 


0.27 


2.03 


0.14 


2.65 


6.63 


1.29 


0.15 


0.30 


2.17 




a x /x 


12 % 


24 % 


19 % 


17 % 


5 % 


5 % 


1 % 


7 % 


3 % 


38 % 




X 


1.73 


1.14 


8.47 


0.90 


47.70 


157.08 


94.53 


2.16 


9.64 


5.90 


C 




0.44 


0.35 


0.47 


0.05 


6.84 


16.66 


3.38 


0.13 


0.60 


0.28 




(T x /X 


25 % 


31 % 


6 % 


6 % 


14 % 


11 % 


4 % 


6 % 


6 % 


5 % 




X 


1.82 


1.22 


7.92 


0.76 


46.96 


155.70 


93.54 


2.30 


9.49 


6.74 


G 


O x 


0.39 


0.25 


2.08 


0.28 


6.52 


14.48 


2.90 


0.29 


0.44 


2.23 




a x /x 


21 % 


20 % 


26 % 


37 % 


14 % 


9 % 


3 % 


13 % 


5 % 


33 % 




X 


2.59 


0.55 


5.38 


0.72 


56.68 


140.80 


98.68 


2.23 


9.43 


5.82 


U 


O x 


0.11 


0.06 


0.37 


0.11 


4.31 


11.68 


3.00 


0.20 


0.22 


0.19 




Ox/X 


4 % 


10 % 


7% 


15 % 


8 % 


8 % 


3 % 


9 % 


2 % 


3 % 



Table 3: PCP of amino-acids coded by codons with same second nucleotide (written at the left) 
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Pa + Pfi 


fV 


Pg 




A T"7" 


— AS 


AG 


77- 

pK a 


TV" 


T 

P I 




X 


O f»/1 

Z.U4 


u.y f 


1U.40 


U.oo 


^1 1 n 
01. 1U 


IZl.oU 


nc ac 

yo.yo 


o nn 
z.UU 


o i c; 

y.io 


O.OD 


Gin/ His 


O x 


AIT 

0.17 


A A1 

0.01 


n a c 

0.05 


A A1 

0.01 






1.3b 


AIT 

0.17 


A AO 

0.02 


A A 1 

0.91 




a x /x 


8 % 


2 % 


% 


1 % 






1 7o 


9 % 


7o 


14 % 




X 


1 79 


1 1 


1Z.UU 


u.ou 








9 04 




o.uu 


Asp/Glu 


O x 


0.17 


0.36 


0.40 


0.02 








0.16 


0.04 


0.22 




a x /x 


10 % 


33 % 


3 % 


2 % 








8 % 


% 


8 % 




X 


1.75 


1.29 


11.40 


0.93 


56.40 


135.70 


96.84 


2.10 


8.88 


7.50 


Asn/Lys 




0.15 


0.28 


0.10 


0.04 


1.40 


0.10 


1.32 


0.08 


0.08 


2.09 




a x /x 


9% 


21 % 


1 % 


4 % 


2 % 


% 


1 % 


4 % 


1 % 


28 % 




X 


2.03 


1.17 


5.85 


0.67 


49.15 


151.10 


94.29 


2.08 


9.70 


5.34 


Cys/Tyr 


O x 


0.14 


0.02 


0.35 


0.18 


5.25 


12.10 


1.40 


0.12 


0.58 


0.32 




a x /x 


7% 


2 % 


6 % 


27 % 


11 % 


8 % 


1 % 


6 % 


6 % 


6 % 




X 


1.32 


1.48 


8.60 


0.89 


43.35 


172.25 


94.74 


2.10 


9.88 


5.99 


Pro/Ser 


O x 


0.20 


0.04 


0.60 


0.07 


7.35 


7.75 


4.76 


0.11 


0.73 


0.31 




Ox/X 


15 % 


3 % 


7% 


8 % 


17 % 


4 % 


5 % 


5 % 


7 % 


5 % 




X 


2.14 


0.81 


8.34 


0.91 


52.05 


141.9 


94.32 


2.22 


9.40 


5.82 


Ala/Thr 


O x 


0.12 


0.15 


0.25 


0.02 


1.35 


5.90 


0.34 


0.13 


0.29 


0.18 




Ox/X 


5 % 


19 % 


3 % 


2 % 


3 % 


4 % 


% 


6 % 


3 % 


3 % 




X 


2.59 


0.54 


5.45 


0.75 


57.75 


140.40 


99.63 


2.32 


9.41 


5.84 


lie/Met 


O x 


0.09 


0.07 


0.25 


0.01 


0.45 


11.90 


2.97 


0.04 


0.19 


0.10 




Ox/X 


3% 


12 % 


5 % 


1 % 


1 % 


8 % 


3 % 


2 % 


2 % 


2 % 




X 


2.64 


0.55 


5.40 


0.79 


59.80 


133.55 


99.70 


2.34 


9.61 


5.97 


Leu/Val 


O x 


0.12 


0.05 


0.50 


0.06 


0.40 


5.45 


2.00 


0.02 


0.01 


0.01 




Ox/X 


5 % 


8 % 


9 % 


8 % 


1 % 


4% 


2 % 


1 % 


% 


% 



Table 4: Table of amino-acid doublet properties 
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